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AU 1812 



RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:18:21 

INPUT SET: S6822.raw 



SEQUENCE LISTING 

(1) General Information: 

(i) APPLICANT: Breece, Tim 

Hayenga, Kirk 
Rinderknecht , Ernst 
Vandlen, Richard 
Yansura, Daniel 

(ii) TITLE OF INVENTION: PROCESS FOR PRODUCING RELAXIN 

(iii) NUMBER OF SEQUENCES: 40 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Mr. Walter H. Dreger 

(B) STREET: 4 Embarcadero Center, Suite 3400 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 
-(F) ZIP: 94111 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 21-JUN-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Dreger, Walter H. 

(B) REGISTRATION NUMBER: 24,190 

(C) REFERENCE/DOCKET NUMBER: A-58117/WHD 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 781-1989 

(B) TELEFAX: (415) 398-3249 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PAGE: 2 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:18:23 

INPUT SET: S6822.raw 

52 (ii) MOLECULE TYPE: peptide 

53 

54 

55 

56 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

57 

58 Met Lys Lys Asn lie Ala Phe Leu Leu Lys Arg 

59 1 5 10 
60 

61 (2) INFORMATION FOR SEQ ID NO : 2 : 
62 

63 (i) SEQUENCE CHARACTERISTICS: 

64 (A) LENGTH: 2 9 amino acids 

65 (B) TYPE: amino acid 

66 (C) STRANDEDNESS: single 

67 (D) TOPOLOGY: linear 
68 

69 (ii) MOLECULE TYPE: peptide 

70 

71 

72 

73 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

74 

75 Asp Ser Trp Met Glu Glu Val lie Lys Leu Cys Gly Arg Glu Leu Val 

76 1 5 10 15 
77 

78 Arg Ala Gin lie Ala lie Cys Gly Met Ser Thr Trp Ser 

79 20 25 
80 

81 (2) INFORMATION FOR SEQ ID NO: 3: 
82 

83 (i) SEQUENCE CHARACTERISTICS: 

84 (A) LENGTH: 13 amino acids 

85 (B) TYPE: amino acid 

86 (C) STRANDEDNESS: single 

87 (D) TOPOLOGY: linear 
88 

89 (ii) MOLECULE TYPE: peptide 

90 

91 

92 

93 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

94 

95 Lys Arg Lys Pro Thr Gly Tyr Gly Ser Arg Lys Lys Arg 

96 1 5 10 
97 

98 (2) INFORMATION FOR SEQ ID NO : 4 : 
99 

100 (i) SEQUENCE CHARACTERISTICS: 

101 (A) LENGTH: 24 amino acids 

102 (B) TYPE: amino acid 



/ 
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INPUT SET: S6822.raw 



103 (C) STRANDEDNESS: single 

104 (D) TOPOLOGY: linear 
105 

106 (ii) MOLECULE TYPE: peptide 

107 

108 

109 

110 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

111 

112 Gin Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cys His Val Gly Cys Thr 

113 15 10 15 
114 

115 Lys Arg Ser Leu Ala Arg Phe Cys 

116 20 
117 

118 (2) INFORMATION FOR SEQ ID NO: 5: 
119 

120 (i) SEQUENCE CHARACTERISTICS: 



121 (A) LENGTH: 13 amino acids 

122 (B) TYPE: amino acid 

123 (C) STRANDEDNESS: single 

124 (D) TOPOLOGY: linear 
125 



126 (ii) MOLECULE TYPE: peptide 

127 

128 

129 

130 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

131 

132 Lys Arg Lys Pro Thr Gly Tyr Gly Ser Arg Lys Lys Arg 

133 15 10 
134 

13 5 (2) INFORMATION FOR SEQ ID NO : 6 : 
136 

13 7 (i) SEQUENCE CHARACTERISTICS: 



138 (A) LENGTH: 13 amino acids 

139 (B) TYPE: amino acid 

140 (C) STRANDEDNESS: single 

141 (D) TOPOLOGY: linear 
142 



143 (ii) MOLECULE TYPE: peptide 

144 

145 

146 

147 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

148 

149 Asp Lys Lys Arg Thr Gly Tyr Gly Ser Arg Arg Arg Lys 

150 15 10 
151 

152 (2) INFORMATION FOR SEQ ID NO : 7 : 
153 
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154 (i) SEQUENCE CHARACTERISTICS: 

155 (A) LENGTH: 13 amino acids 

156 (B) TYPE: amino acid 

157 (C) STRANDEDNESS: single 

158 (D) TOPOLOGY: linear 
159 

160 (ii) MOLECULE TYPE: peptide 

161 

162 

163 

164 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

165 

166 Asp Lys Lys Arg Thr Gly Tyr Gly Ser Arg Lys Lys Arg 

167 15 10 
168 

16 9 (2) INFORMATION FOR SEQ ID NO : 8 : 
170 

171 (i) SEQUENCE CHARACTERISTICS: 

172 (A) LENGTH: 13 amino acids 

173 (B) TYPE: amino acid 

174 (C) STRANDEDNESS: single 

175 (D) TOPOLOGY: linear 
176 

177 (ii) MOLECULE TYPE: peptide 

178 

179 

180 

181 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

182 

183 Lys Arg Lys Pro Thr Gly Tyr Gly Ser Arg Arg Arg Lys 

184 1 5 10 
185 

186 (2) INFORMATION FOR SEQ ID NO : 9 : 
187 

188 (i) SEQUENCE CHARACTERISTICS: 

189 (A) LENGTH: 231 base pairs 

190 (B) TYPE: nucleic acid 

191 (C) STRANDEDNESS: single 

192 (D) TOPOLOGY: linear 
193 

194 (ii) MOLECULE TYPE: cDNA 

195 

196 

197 (ix) FEATURE: 

198 (A) NAME/KEY: CDS 

199 (B) LOCATION: 1..231 
200 

201 

202 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

203 

204 ATG AAA AAG AAT ATC GCA TTT CTT CTT AAA CGG GAC TCA TGG 



r 



PAGE: 5 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:18:30 

INPUT SET: S6822.raw 



- - > 



205 

206 
207 
208 
209 
210 
211 
212 
213 
214 
215 
216 
217 
218 
219 
220 
221 
222 
223 
224 
225 
226 
227 
228 
229 
230 
231 
232 
233 
234 
235 
236 
237 
238 
239 
240 
241 
242 
243 
244 
245 
246 
247 
248 
249 
250 
251 
252 
253 
254 
255 



ATG GAG 48 

Met Lys Lys Asn lie Ala Phe Leu Leu Lys Arg Asp Ser Trp Met 
15 10 15 



CfiAj 



AA GTT ATT-^AA 
G ATT GCC 



C GGC CGC GAA TTA GTT CGC GCG 



Glu Vai lie Lys "Leu Cys Gly Arg Glu Leu Val Arg Ala Gin lie 
20 25 30 

JVTT TGC GGC ATG AGQ^ACC TGG AGC AAA AGG AAA CCC ACT 

^GGT TAT GGT ^_144) 

lTe~Cys Gly MeL"1§erTnr Trp Ser Lys Arg Lys Pro Thr Gly Tyr 
35 40 45 

T££-CGA__4AA AAG AGA C AA CTC TAC AGT GCA TTG GCT AAT AAA 
TGC 192' 



Ser Arg Lys Lys ArgGln Leu Tyr ser Ala Leu Ala Asn Lys Cys 
50 55 60 



Glu 



Ala 



Gly 



Cys 




CAT GTT GGT TGT ACC AAA AGA TCT CTT GCT AGA TTT TGC 



CUP- 

-y Cys T 



His Val Gly Cys Thr Lys Arg Ser Leu Ala Arg Phe Cys 
65 70 75 



(2) INFORMATION FOR SEQ ID 




fO? 



Jo 



(i) SEQUENCE CHARACTERISTICS: ^ 

(A) LENGTH: 77 amino acids j, <,L>^ 

(B) TYPE: amino acid . ',(■ 
(D) TOPOLOGY: linear *\^)^\<, \ 

(ii) MOLECULE TYPE: protein •-— ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 



7 f ^por^ioa 



Met Lys Lys Asn lie Ala Phe Leu Leu Lys Arg Asp Ser Trp Met 
15 10 15 

Glu Val lie Lys Leu Cys Gly Arg Glu Leu Val Arg Ala Gin lie 
20 25 30 

lie Cys Gly Met Ser Thr Trp Ser Lys Arg Lys Pro Thr Gly Tyr 
35 40 45 

Ser Arg Lys Lys Arg Gin Leu Tyr Ser Ala Leu Ala Asn Lys Cys 
50 55 60 

His Val Gly Cys Thr Lys Arg Ser Leu Ala Arg Phe Cys 
65 70 75 



Glu 



Ala 



Gly 



Cys 




*{fAu 



^ — -fLu 
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TIME: 15:18:32 



■ - > 
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INPUT SET: S6822.raw 




(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 431.. 586 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC 
ATGAAAAATC TCATTGCTGA 60 



°5 s ^ i© ^ 





GTTGTTATTT AAGCTTGCCC AAAAAGAAGA AGAGTCGAAA 
GAACTGTGTG CGCAGGTAGA 120 

AGCTTTGGAG ATTATCGTCA CTGCAATGCT TCGCAATATG 
GCGCAAAATG ACCAACAGCG 180 

GTTGATTGAT CAGGTAGAGG GGGCGCTGTA CGAGGTAAAG 
CCCGATGCCA GCATTCCTGA 240 

CGACGATACG GAGCTGCTGC GCGATTACGT AAAGAAGTTA 
TTGAAGCATC CTCGTCAGTA 300 

AAAAGTTAAT CTTTTCAACA GCTGTCATAA AGTTGTCACG 
GCCGAGACTT ATAGTCGCTT 360 

TGTTTTTATT TTTTAATGTA TTTGTACGCA AGTTCACGTA 
AAAAGGGTAT CTAGAGGTTG 420 

AGGTGATTTT ATG AAA AAG AAT ATC GCA TTT CTT CTT GCA TCT 
ATG TTC 469 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe 
15 10 

GTT TTT TCT ATT GCT ACA AAT GCC TAT GCA GAC TCA TGG ATG 
GAG GAA 517 

Val Phe Ser lie Ala Thr Asn Ala Tyr Ala Asp Ser Trp Met Glu Glu 
15 20 25 

GTT ATT AAA TTA TGC GGC CGC GAA TTG GTA CGC GCG CAA ATA 
GCG ATA 565 
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307 
308 
309 
310 
311 
312 
313 
314 
315 
316 
317 
318 
319 
320 
321 
322 
323 
324 
325 
326 
327 
328 
329 
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331 
332 
333 
334 
335 
336 
337 
338 
339 
340 
341 
342 
343 
344 
345 
346 
347 
348 
349 
350 
351 
352 
353 
354 
355 
356 
357 



Val lie Lys Leu Cys Gly Arg Glu Leu Val Arg Ala Gin lie Ala lie 



30 



35 



40 



45 



TGC GGT ATG AGT ACA TGG AGT TGAAGAA 
593 

Cys Gly Met Ser Thr Trp Ser 
50 



(2) INFORMATION FOR SEQ ID 



NC^3^): 




t! 



-77 ^ 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe Ser 
15 10 15 

lie Ala Thr Asn Ala Tyr Ala Asp Ser Trp Met Glu Glu Val lie Lys 
20 25 30 

Leu Cys Gly Arg Glu Leu Val Arg Ala Gin lie Ala lie Cys Gly Met 
35 40 45 

Ser Thr Trp Ser 
50 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 438.. 1238 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC 



PAGE: 8 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:18:47 

INPUT SET: S6822.raw 



--> 358 ATGAAAAATC TCATTGCTGA 60 

359 

360 GTTGTTATTT AAGCTTGCCC AAAAAGAAGA AGAGTCGAAA 
--> 361 GAACTGTGTG CGCAGGTAGA 120 

362 

363 AGCTTTGGAG ATTATCGTCA CTGCAATGCT TCGCAATATG 
--> 364 GCGCAAAATG ACCAACAGCG 180 

365 

366 GTTGATTCAT CAGGTAGAGG GGGCGCTGTA CGAGGTAAAG 
--> 367 CCCGATGCCA GCATTCCTGA 240 

368 

369 CGACGATACG GAGCTGCTGC GCGATTACGT AAAGAAGTTA 
--> 370 TTGAAGCATC CTCGTCAGTA 300 

371 

372 AAAAGTTAAT CTTTTCAACA GCTGTCATAA AGTTGTCACG 
--> 373 GCCGAGACTT ATAGTCGCTT 360 

374 

375 TGTTTTTATT TTTTAATGTA TTTGTAACTA GTACGCAAGT 
--> 376 TCACGTAAAA AGGGTATCTA 420 

377 

378 GAGGTTGAGG TGATTTT ATG AAA AAG AAT ATC GCA TTT CTT 
--> 379 CTT GCA TCT 470 

380 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser 

381 15 10 
382 
383 

--> 384 ACT ACA 518 

385 

386 15 20 25 

387 
388 

--> 389 TTC AAG 566 

390 

391 30 35 40 

392 
393 

--> 394 ACT GTT 614 

395 

396 45 50 55 

397 
398 

--> 399 TTT TAC ACA 662 

400 

401 60 65 70 75 

402 
403 

--> 404 GAT GTG AAG 710 

405 

406 80 85 90 

407 

408 CAG ACG TAC TTG GCA CGG GTC TTC TCC TAC CCG GCA GGG AAT 



CTT 


GCA 


TCT 


ATG 


TTC 


GTT 


ACT 


ACA 




Met 


Phe 


Val 


AAT 


ACT 


GTG 


TTC 


AAG 




Asn 


Thr 


Val 






30 


ACA 


ATT 


TTG 


ACT 


GTT 




Thr 


He 


Leu 




45 




CAA 


ATA 


AGC 


TTT 


TAC 


ACA 


Gin 


He 


Ser 


60 






ACA 


GAC 


ACA 


GAT 


GTG 


AAG 


Thr 


Asp 


Thr 


CAG 


ACG 


TAC 
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--> 409 GTG GAG 758 

410 Gin Thr Tyr Leu Ala Arg Val Phe Ser Tyr Pro Ala Gly Asn Val Glu 

411 95 100 105 
412 

413 AGC ACC GGT TCT GCT GGG GAG CCT CTG TAT GAG AAC TCC CCA 

414 GAG TTC 806 

415 Ser Thr Gly Ser Ala Gly Glu Pro Leu Tyr Glu Asn Ser Pro Glu Phe 

416 110 115 120 
417 

418 ACA CCT TAC CTG GAG ACA AAC CTC GGA CAG CCA ACA ATT CAG 
--> 419 AGT TTT 854 

420 Thr Pro Tyr Leu Glu Thr Asn Leu Gly Gin Pro Thr lie Gin Ser Phe 

421 125 130 135 
422 

423 GAA CAG GTG GGA ACA AAA GTG AAT GTG ACC GTA GAA GAT 
--> 424 GAA CGG ACT 902 

425 Glu Gin Val Gly Thr Lys Val Asn Val Thr Val Glu Asp Glu Arg Thr 

426 140 145 150 155 
427 

428 TTA GTC AGA AGG AAC AAC ACT TTC CTA AGC CTC CGG GAT GTT 
--> 429 TTT GGC 950 

430 Leu Val Arg Arg Asn Asn Thr Phe Leu Ser Leu Arg Asp Val Phe Gly 

431 160 165 170 
432 

433 AAG GAC TTA ATT TAT ACA CTT TAT TAT TGG AAA TCT TCA AGT 
--> 434 TCA GGA 998 

435 Lys Asp Leu lie Tyr Thr Leu Tyr Tyr Trp Lys Ser Ser Ser Ser Gly 

436 175 180 185 
437 

438 AAG AAA ACA GCC AAA ACA AAC ACT AAT GAG TTT TTG ATT GAT 
--> 439 GTG GAT 1046 

440 Lys Lys Thr Ala Lys Thr Asn Thr Asn Glu Phe Leu lie Asp Val Asp 

441 190 195 200 
442 

443 AAA GGA GAA AAC TAC TGT TTC AGT GTT CAA GCA GTG ATT CCC 
--> 444 TCC CGA 1094 

445 Lys Gly Glu Asn Tyr Cys Phe Ser Val Gin Ala Val lie Pro Ser Arg 

446 205 210 215 
447 

448 ACA GTT AAC CGG AAG AGT ACA GAC AGC CCG GTA GAG TGT 
--> 449 ATG GGC CAG 1142 

450 Thr Val Asn Arg Lys Ser Thr Asp Ser Pro Val Glu Cys Met Gly Gin 

451 220 225 230 235 
452 

453 GAG AAA GGC CAA TTC AGA GAA ATA TTC TAC ATC ATT GGA GCT 
--> 454 GTG GTA 1190 

455 Glu Lys Gly Gin Phe Arg Glu He Phe Tyr He He Gly Ala Val Val 

456 240 245 250 
457 

458 TTT GTG GTC ATC ATC CTT GTC ATC ATC CTG GCT ATA TCT CTA 
--> 459 CAC TAAAATTCTC 1245 
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460 
461 
462 
463 
464 
465 
466 
467 
468 
469 
470 
471 
472 
473 
474 
475 
476 
477 
478 
479 
480 
481 
482 
483 
484 
485 
486 
487 
488 
489 
490 
491 
492 
493 
494 
495 
496 
497 
498 
499 
500 
501 
502 
503 
504 
505 
506 
507 
508 
509 
510 



Phe Val Val He He Leu Val He He Leu Ala He Ser Leu His 



255 



260 



265 



ATGTTTGACA GCTTATCATC GATAAGCTTT AATGCGGTAG 
TTTATCACAG TTAAATTGCT 1305 

AACGCAGTCA GGCACCGTGT ATGAAATCTA ACAATGCGCT 
CATCGTCATC CTCGGCACCG 1365 

TCACCCTGGA TGCTGTAGGC ATAGGCTTGG TTATGCCGGT 
ACTGCCGGGC CTCTTGCGGG 1425 

ATATCGTCCA TTCCGACAGC ATCGCCAGTC ACTATGGCGT 
GCTCCTAGCG CTATATGCGT 1485 

TGATGCAATT TCTAT 





02 



(2) INFORMATION FOR SEQ ID N^38 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Lys Lys Asn He Ala Phe Leu Leu Ala Ser Met 
15 10 

He Ala Thr Asn Ala Tyr Ala Ser Gly Thr Thr Asn 
20 25 

Tyr Asn Leu Thr Trp Lys Ser Thr Asn Phe Lys Thr 
35 40 



Phe Val Phe Ser 
15 

Thr Val Ala Ala 
30 

He Leu Glu Trp 
45 



Glu Pro Lys Pro Val Asn Gin Val Tyr Thr Val Gin He Ser Thr Lys 
50 55 60 



Ser Gly Asp Trp Lys Ser Lys Cys Phe Tyr Thr Thr 

65 70 75 

Asp Leu Thr Asp Glu He Val Lys Asp Val Lys Gin 

85 90 

Arg Val Phe Ser Tyr Pro Ala Gly Asn Val Glu Ser 

100 105 



Asp Thr Glu Cys 
80 

Thr Tyr Leu Ala 
95 

Thr Gly Ser Ala 
110 



1500 



Gly Glu Pro Leu Tyr Glu Asn Ser Pro Glu Phe Thr Pro Tyr Leu Glu 
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511 115 120 125 

512 

513 Thr Asn Leu Gly Gin Pro Thr lie Gin Ser Phe Glu Gin Val Gly Thr 

514 130 135 140 
515 

516 Lys Val Asn Val Thr Val Glu Asp Glu Arg Thr Leu Val Arg Arg Asn 

517 145 150 155 160 
518 

519 Asn Thr Phe Leu Ser Leu Arg Asp Val Phe Gly Lys Asp Leu lie Tyr 

520 165 170 175 
521 

522 Thr Leu Tyr Tyr Trp Lys Ser Ser Ser Ser Gly Lys Lys Thr Ala Lys 

523 180 185 190 
524 

525 Thr Asn Thr Asn Glu Phe Leu lie Asp Val Asp Lys Gly Glu Asn Tyr 

526 195 200 205 
527 

528 Cys Phe Ser Val Gin Ala Val lie Pro Ser Arg Thr Val Asn Arg Lys 

529 210 215 220 
530 

531 Ser Thr Asp Ser Pro Val Glu Cys Met Gly Gin Glu Lys Gly Gin Phe 

532 225 230 235 240 
533 

534 Arg Glu He Phe Tyr He He Gly Ala Val Val Phe Val Val He He 

535 245 250 255 
536 

537 Leu Val He He Leu Ala He Ser Leu His 

538 260 265 
539 

540 

■-> 541 (2) INFORMATION FOR SEQ ID NO: 12: 

542 

543 (i) SEQUENCE CHARACTERISTICS: 

544 (A) LENGTH: 11 amino acids 

545 (B) TYPE: amino acid 

546 (C) STRANDEDNESS: single 

547 (D) TOPOLOGY: linear 
548 

549 <ii) MOLECULE TYPE: peptide 

550 

551 

552 

553 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

554 

555 Met Lys Lys Asn He Ala Phe Leu Leu Arg Lys 

556 1 5 10 
557 

558 (2) INFORMATION FOR SEQ ID NO:13: 
559 

560 (i) SEQUENCE CHARACTERISTICS: 

561 (A) LENGTH: 11 amino acids 
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562 (B) TYPE: amino acid 

563 (C) STRANDEDNESS: single 

564 (D) TOPOLOGY: linear 
565 

566 (ii) MOLECULE TYPE: peptide 

567 

568 

569 

570 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

571 

572 Met Lys Lys Asn lie Ala Phe Leu Leu Arg Arg 

573 1 5 10 
574 

575 (2) INFORMATION FOR SEQ ID NO : 14 : 
576 

577 (i) SEQUENCE CHARACTERISTICS: 

578 (A) LENGTH: 11 amino acids 

579 (B) TYPE: amino acid 

580 (C) STRANDEDNESS: single 

581 (D) TOPOLOGY: linear 
582 

583 (ii) MOLECULE TYPE: peptide 

584 

585 

586 

587 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

588 

589 Met Lys Lys Asn lie Ala Phe Leu Leu Lys Lys 

590 1 5 10 
591 

592 (2) INFORMATION FOR SEQ ID NO: 15: 
593 

594 (i) SEQUENCE CHARACTERISTICS: 

--> 595 (A) LENGTH: 42 base pairs 

596 (B) TYPE: nucleic acid 

597 (C) STRANDEDNESS: both 

598 (D) TOPOLOGY: linear 
599 

600 (ii) MOLECULE TYPE: cDNA 

601 

602 

603 (ix) FEATURE: 

604 (A) NAME/KEY: misc_feature 

605 (B) LOCATION: 5.. 42 

606 -(D) OTHER INFORMATION: /note= "Complementary 

607 double -stranded binding between bases 5 and 42 to 

608 SEQ ID NO: 16. " 
609 

610 

611 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

612 
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613 
614 
615 
616 
617 
618 

--> 619 

620 
621 
622 
623 
624 
625 
626 
627 
628 
629 
630 
631 
632 
633 
634 
635 
636 
637 
638 
639 
640 
641 
642 

--> 643 

644 
645 
646 
647 
648 
649 
650 
651 
652 
653 
654 
655 
656 
657 
658 
659 
660 
661 
662 
663 




RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:19:21 

INPUT SET: S6822.raw 



CTAGAATTAT GAAAAAGAAT ATCGCATTTC TTCTTAAACG GG 

& - 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 4. .41 

(D) OTHER INFORMATION: /note= "Complementary 

double -stranded binding between bases 4 and 41 to 
SEQ ID NO: 15. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
AGTCCCGTTT AAGAAGAAAT GCGATATTCT TTTTCATAAT T 

<5^r 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 5. .42 

(D) OTHER INFORMATION: /note= "Complementary 

double -stranded binding between bases 5 and 42 to 
SEQ ID NO: 18. 11 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9. .41 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 



PAGE: 14 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/080,354 



DATE: 10/27/93 
TIME: 15:19:28 



INPUT SET: S6822.raw 



•-> 



- - > 



- -> 



664 
665 
666 
667 
668 
669 
670 
671 
672 
673 
674 
675 
676 
677 
678 
679 
680 
681 
682 
683 
684 
685 
686 
687 
688 
689 
690 
691 
692 
693 
694 
695 
696 
697 
698 
699 
700 
701 
702 
703 
704 
705 
706 
707 
708 
709 
710 
711 
712 
713 
714 



CTAGAATT ATG T 



GCT ATG CCT CTA TCT AGT AAA CGG G 



Met Phe Pro Ala Met Pro Leu Ser Ser Lys Arg 
1 5 ^ 10 




(2) INFORMATION FOR SEQ ID N| 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

Met Phe Pro Ala Met Pro Leu Ser Ser Lys Arg 
15 10 




(2) INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 4. .41 

(D) OTHER INFORMATION: /note= "Complementary 

double- stranded binding between bases 4 and 41 to 
SEQ ID NO: 17. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO^TT^ 

AGTCCCGTTT ACTAGATAGA GGCATAGCTG GGAACATAAT T 
41 

N<^20^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 





(2) INFORMATION FOR SEQ ID 



PAGE: 15 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:19:35 

INPUT SET: S6822.raw 



- -> 



- - > 



715 
716 
717 
718 
719 
720 
721 
722 
723 
724 
725 
726 
727 
728 
729 
730 
731 
732 
733 
734 
735 
736 
737 
738 
739 
740 
741 
742 
743 
744 
745 
746 
747 
748 
749 
750 
751 
752 
753 
754 
755 
756 
757 
758 
759 
760 
761 
762 
763 
764 
765 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 5 . . 64 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 5 . . 64 

(D) OTHER INFORMATION: /note= "Complementary 

double-stranded binding between bases 5 and 64 to 
SEQ ID NO: 21. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

CGCG CAG ATT GCC ATT TGC GGC ATG AGC ACC TGG AGC AAA 
AGG AAA CCC 49 

Gin lie Ala lie Cys Gly Met Ser Thr Trp Ser Lys Arg Lys Pro 



10 



15 



ACT GGT TAT GGT TCT 
Thr Gly Tyr Gly Ser 
20 



64 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Gin lie Ala lie Cys Gly Met Ser Thr Trp Ser Lys Arg Lys Pro Thr 
15 10 15 



Gly Tyr Gly Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 



PAGE: 16 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:19:42 

INPUT SET: S6822.raw 



766 (C) STRANDEDNESS: both 

767 (D) TOPOLOGY: linear 
768 

769 (ii) MOLECULE TYPE: cDNA 

770 

771 

772 (ix) FEATURE: 

773 (A) NAME /KEY : misc_feature 

774 (B) LOCATION: 3 . . 62 

775 (D) OTHER INFORMATION: /note= "Complementary 

776 double- stranded binding between bases 3 and 62 to 

777 SEQ ID NO: 20. " 
778 

779 

780 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

781 

782 CGAG AAC CAT AACCAGTGGG TTTCCTTTTG CTCCAGGTGC 

783 TCATGCCGCA AATGGCAATC 60 
784 

785 TG 62 
786 

787 (2) INFORMATION FOR SEQ ID NO:23: 

788 

789 (i) SEQUENCE CHARACTERISTICS: 

790 (A) LENGTH: 50 base pairs 

791 (B) TYPE: nucleic acid 

792 (C) STRANDEDNESS: both 

793 (D) TOPOLOGY: linear 
794 

795 (ii) MOLECULE TYPE: cDNA 

796 

797 

798 (ix) FEATURE: 

799 (A) NAME/KEY: misc_feature 

800 (B) LOCATION: 5 . . 50 

801 (D) OTHER INFORMATION: /note= "Complementary 

802 double -stranded binding between bases 5 and 50 

803 with SEQ ID NO: 24." 
804 

805 

806 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

807 

808 GGCCACTCTG TGCGGTGCTG AACTGGTTGA CGCTCTGCAG 

809 TTTGTTTGCG 50 
810 

811 (2) INFORMATION FOR SEQ ID NO: 24: 
812 

813 (i) SEQUENCE CHARACTERISTICS: 

814 (A) LENGTH: 50 base pairs 

815 (B) TYPE: nucleic acid 

816 (C) STRANDEDNESS: both 



PAGE: 17 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:19:48 

INPUT SET: S6822.raw 



817 (D) TOPOLOGY: linear 

818 

819 (ii) MOLECULE TYPE: cDNA 

820 

821 

822 (ix) FEATURE: 

823 (A) NAME /KEY : misc_feature 

824 (B) LOCATION: 6.. 50 

825 (D) OTHER INFORMATION: /note= "Complementary 

826 double-stranded binding between bases 6 and 50 

827 with SEQ ID NO: 23." 
828 

829 

830 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

831 

832 GTCACCGCAA ACAAACTGCA GAGCGTCAAC CAGTTCAGCA 
--> 833 CCGCAAGAGT 50 
834 

835 (2) INFORMATION FOR SEQ ID NO: 25: 
836 

837 (i) SEQUENCE CHARACTERISTICS: 

--> 838 (A) LENGTH: 55 base pairs 

839 (B) TYPE: nucleic acid 

840 (C) STRANDEDNESS : both 

841 (D) TOPOLOGY: linear 
842 

843 (ii) MOLECULE TYPE: cDNA 

844 

845 

846 (ix) FEATURE: 

847 (A) NAME /KEY : misc_feature 

848 (B) LOCATION: 1..55 

849 (D) OTHER INFORMATION: /note= "Complementary 

850 double -stranded binding between bases 1 and 55 

851 with SEQ ID NO: 26." 
852 

853 

854 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

855 

856 GGTCCCGAAA CTCTGTGCGG TGCTGAACTG GTTGACGCTC 
--> 857 TGCAGTTTGT TTGCG 55 
858 

859 (2) INFORMATION FOR SEQ ID NO: 26: 
860 

861 (i) SEQUENCE CHARACTERISTICS: 

--> 862 (A) LENGTH: 64 base pairs 

863 (B) TYPE: nucleic acid 

864 (C) STRANDEDNESS: both 

865 (D) TOPOLOGY: linear 
866 

867 (ii) MOLECULE TYPE: cDNA 



PAGE: 18 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:19:55 

INPUT SET: S6822.raw 



868 
869 

870 (ix) FEATURE: 

871 (A) NAME /KEY : misc_f eature 

872 (B) LOCATION: 6 . . 64 

873 (D) OTHER INFORMATION: /note= "Complementary 

874 double -stranded binding between bases 6 and 64 

875 with SEQ ID NO: 25." 
876 

877 

878 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

879 

880 GTCACCGCAA ACAAACTGCA GAGCGTCAAC CAGTTCAGCA 

881 CCGCACAGAG TTTCGGGACC 60 
882 

883 TGCA 64 
884 

885 (2) INFORMATION FOR SEQ ID NO: 27: 
886 

887 (i) SEQUENCE CHARACTERISTICS: 

888 (A) LENGTH: 84 base pairs 

889 (B) TYPE: nucleic acid 

890 (C) STRANDEDNESS: both 

891 (D) TOPOLOGY: linear 
892 

893 (ii) MOLECULE TYPE: cDNA 

894 

895 

896 (ix) FEATURE: 

897 (A) NAME/KEY: misc_f eature 

898 (B) LOCATION: 5 . . 84 

899 (D) OTHER INFORMATION: /note= "Complementary 

900 double -stranded binding between bases 5 and 84 

901 with SEQ ID NO: 28." 
902 

903 

904 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

905 

906 CTAGAATTAT GATGATTACT CTGCGCAAAC TTCCTCTGGC 

907 GGTTGCCGTC GCAGCGGGCG 60 
908 

909 TAATGTCTGC TCAGGCCATG GCCA 84 
910 

911 (2) INFORMATION FOR SEQ ID NO: 28: 
912 

913 (i) SEQUENCE CHARACTERISTICS: 

914 (A) LENGTH: 84 base pairs 

915 (B) TYPE: nucleic acid 

916 (C) STRANDEDNESS: both 

917 (D) TOPOLOGY: linear 
918 



PAGE: 19 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:20:02 

INPUT SET: S6822.raw 



919 (ii) MOLECULE TYPE: cDNA 

920 

921 

922 (ix) FEATURE: 

923 (A) NAME /KEY : misc_feature 

924 (B) LOCATION: 5 . . 84 

925 (D) OTHER INFORMATION: /note= "Complementary 

926 double- stranded binding between bases 5 and 84 

927 with SEQ ID NO: 27." 
928 

929 

930 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

931 

932 GATCTGGCCA TGGCCTGAGC AGACATTACG CCCGCTGCGA 

933 CGGCAACCGC CAGAGGAAGT 60 
934 

935 TTGCGCAGAG TAATCATCAT AATT 84 
936 

937 (2) INFORMATION FOR SEQ ID NO: 29: 
938 

939 (i) SEQUENCE CHARACTERISTICS: 

940 (A) LENGTH: 52 base pairs 

941 (B) TYPE: nucleic acid 

942 (C) STRANDEDNESS : both 

943 (D) TOPOLOGY: linear 
944 

945 (ii) MOLECULE TYPE: cDNA 

946 

947 

948 (ix) FEATURE: 

94 9 (A) NAME/KEY: misc_feature 

950 (B) LOCATION: 1 . . 52 

951 (D) OTHER INFORMATION: /note= "Complementary 

952 double -stranded binding between bases 1 and 52 

953 with SEQ ID NO: 30." 
954 

955 

956 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

957 

958 CAACTCTACA GTGCATTGGC TAATAAATGT TGCCATGTTG 

959 GTTGTACCAA AA 52 
960 

961 (2) INFORMATION FOR SEQ ID NO: 30: 
962 

963 (i) SEQUENCE CHARACTERISTICS: 

964 (A) LENGTH: 60 base pairs 

965 (B) TYPE: nucleic acid 

966 (C) STRANDEDNESS: both 

967 (D) TOPOLOGY: linear 
968 

969 (ii) MOLECULE TYPE: cDNA 



PAGE: 20 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:20:09 

INPUT SET: S6822.raw 



970 
971 

972 (ix) FEATURE: 

973 (A) NAME/KEY: misc_feature 

974 (B) LOCATION: 5 . . 56 

975 (D) OTHER INFORMATION: /note= "Complementary 

976 double -stranded binding between bases 5 and 56 

977 with SEQ ID NO: 2 9." 
978 

979 

980 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

981 

982 GATCTTTTGG TACAACCAAC ATGGCAACAT TTATTAGCCA 

983 ATGCACTGTA GAGTTGTGCA 60 

984 
985 

986 (2) INFORMATION FOR SEQ ID NO: 31: 
987 

988 (i) SEQUENCE CHARACTERISTICS: 

989 (A) LENGTH: 13 amino acids 

990 (B) TYPE: amino acid 

991 (C) STRANDEDNESS : single 

992 (D) TOPOLOGY: linear 
993 

994 (ii) MOLECULE TYPE: peptide 

995 

996 

997 

998 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

999 

1000 Asp Lys Lys Arg Thr Gly Tyr Gly Ser Arg Arg Arg Lys 

1001 15 10 
1002 

1003 (2) INFORMATION FOR SEQ ID NO: 32: 
1004 

1005 (i) SEQUENCE CHARACTERISTICS: 

1006 (A) LENGTH: 13 amino acids 

1007 (B) TYPE: amino acid 

1008 (C) STRANDEDNESS: single 

1009 (D) TOPOLOGY: linear 
1010 

1011 (ii) MOLECULE TYPE: peptide 

1012 

1013 

1014 

1015 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32- 

1016 

1017 Asp Lys Lys Arg Thr Gly Tyr Gly Ser Arg Lys Lys Arg 

1018 15 10 
1019 

1020 (2) INFORMATION FOR SEQ ID NO: 33: 



PAGE: 21 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:20:16 

INPUT SET: S6822.raw 



- - > 



- -> 



1021 
1022 
1023 
1024 
1025 
1026 
1027 
1028 
1029 
1030 
1031 
1032 
1033 
1034 
1035 
1036 
1037 
1038 
1039 
1040 
1041 
1042 
1043 
1044 
1045 
1046 
1047 
1048 
1049 
1050 
1051 
1052 
1053 
1054 
1055 
1056 
1057 
1058 
1059 
1060 
1061 
1062 
1063 
1064 
1065 
1066 
1067 
1068 
1069 
1070 
1071 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Lys Arg Lys Pro Thr Gly Tyr Gly Ser Arg Arg Arg Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 




P teofifc. PJ 



IPs 



or <Le^r- 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3. .452 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GC GGC CGC GAA TTA GTT CGC GCG CAG ATT GCC ATT TGC GGC 
ATG AGC 47 

Gly Arg Glu Leu Val Arg Ala Gin lie Ala lie Cys Gly Met Ser 





1 




ACC 


TGG 


AGC 


CAG 


ACA 


CCT 


Thr 


Trp 


Ser 


AGA 


CCA 


GTG 


ACA 


GAA 




Arg 


Pro 


Val 


ACC 


ATA 


AAT 


GAG 


CTG 





10 



15 



95 



20 



25 



30 



143 



35 



191 



40 



45 



PAGE: 22 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:20:23 

INPUT SET: S6822.raw 



- - > 



- - > 



1072 
1073 
1074 
1075 
1076 
1077 
1078 
1079 
1080 
1081 
1082 
1083 
1084 
1085 
1086 
1087 
1088 
1089 
1090 
1091 
1092 
1093 
1094 
1095 
1096 
1097 
1098 
1099 
1100 
1101 
1102 
1103 
1104 
1105 
1106 
1107 
1108 
1109 
1110 
1111 
1112 
1113 
1114 
1115 
1116 
1117 
1118 
1119 
1120 
1121 
1122 



Thr lie Asn Met Met Ser Glu Phe Val Ala Asn Leu Pro Gin Glu Leu 
50 55 60 



AAG TTA ACC CTG TCT GAG ATG CAG CCA GCA TTA CCA CAG 
CAA CAA 239 

Lys Leu Thr Leu Ser Glu Met Gin Pro Ala Leu Pro Gin 

65 70 75 

CAT GTA CCT GTA TTA AAA GAT TCC AGT CTT CTC TTT GAA 
TTT AAG 287 

His Val Pro Val Leu Lys Asp Ser Ser Leu Leu Phe Glu 
80 85 90 

AAA CTT ATT CGC AAT AGA CAA AGT GAA GCC GCA GAC AGC 
AGT CCT TCA 335 

Lys Leu lie Arg Asn Arg Gin Ser Glu Ala Ala Asp Ser 
100 105 

GAA TTA AAA TAC TTA GGC TTG GAT ACT CAT TCT CGA AAA 
AGA CAA 383 

Glu Leu Lys Tyr Leu Gly Leu Asp Thr His Ser Arg Lys 
115 120 

CTC TAC AGT GCA TTG GCT AAT AAA TGT TGC CAT GTT GGT 
ACC AAA 431 

Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cys His Val Gly 
130 135 140 

AGA TCT CTT GCT AGA TTT TGC TGAGATGAAG CTAATTGTGC 
ACATCTCGTA 482 

Arg Ser Leu Ala Arg Phe Cys 

145 150 

TAATATTCAC ACATATTCTT AATGACATTT CACTGATGCT 
TCTATCAGGT CAATTCTCAT 542 

GTTTGACAGC TTATCATCGA TAAGCTTTAA TGCGGTAGTT 
TATCACAGTT AAATTGCTAA 602 

CGCAGTCAGG CACCGTGTAT GAAATCTAAC AATGCGCTCA 
TCGTCATCCT CGGCACCGTC 662 

ACCCTGGATG CTGTAGGCAT AGGCTTGGTT ATGCCGGTAC 
TGCCGGGCCT CTTGCGGGAT 722 

ATCGTCCATT CCGACAGCAT CGCCAGTCAC TATGGCGTGC 
TGCTAGCGCT ATATGCGTTG 782 

ATGCAATTTC TATGCGCACC CGTTCTCGGA GCACTGTCCG 
ACCGCTTTGG CCGCCGCCCA 842 



CTA 

Leu Gin Gin 
GAA 

Glu Phe Lys 
95 



Ser Pro Ser 
110 

AAG 

Lys Arg Gin 
125 

TGT 

Cys Thr Lys 



PAGE: 23 RAW SEQUENCE LISTING DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:20:30 

INPUT SET: S6822.raw 



1123 GTCCTGCTCG CTTCGCTACT TGGAGCCACT ATCGACTACG 

1124 CGATCATGGC GACCACACCC 902 
1125 

1126 GTCCTGTGGA TCC 915 

1127 

1128 

112 9 (2) INFORMATION FOR SEQ ID NO: 36: 
1130 

1131 (i) SEQUENCE CHARACTERISTICS: 

1132 (A) LENGTH: 150 amino acids 

1133 (B) TYPE: amino acid 

1134 (D) TOPOLOGY: linear 
1135 

1136 (ii) MOLECULE TYPE: protein 

1137 

1138 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

1139 

1140 Gly Arg Glu Leu Val Arg Ala Gin He Ala He Cys Gly Met Ser Thr 

1141 15 10 15 
1142 

1143 Trp Ser Lys Arg Ser Leu Ser Gin Glu Asp Ala Pro Gin Thr Pro Arg 

1144 20 25 30 
1145 

1146 Pro Val Ala Glu He Val Pro Ser Phe He Asn Lys Asp Thr Glu Thr 

1147 35 40 45 
1148 

1149 He Asn Met Met Ser Glu Phe Val Ala Asn Leu Pro Gin Glu Leu Lys 

1150 50 55 60 
1151 

1152 Leu Thr Leu Ser Glu Met Gin Pro Ala Leu Pro Gin Leu Gin Gin His 

1153 65 70 75 80 
1154 

1155 Val Pro Val Leu Lys Asp Ser Ser Leu Leu Phe Glu Glu Phe Lys Lys 

1156 85 90 95 
1157 

1158 Leu He Arg Asn Arg Gin Ser Glu Ala Ala Asp Ser Ser Pro Ser Glu 

1159 100 105 110 
1160 

1161 Leu Lys Tyr Leu Gly Leu Asp Thr His Ser Arg Lys Lys Arg Gin Leu 

1162 115 120 125 
1163 

1164 Tyr Ser Ala Leu Ala Asn Lys Cys Cys His Val Gly Cys Thr Lys Arg 

1165 130 135 140 
1166 

1167 Ser Leu Ala Arg Phe Cys 

1168 145 150 
1169 

1170 (2) INFORMATION FOR SEQ ID NO: 39: 

1171 

1172 (i) SEQUENCE CHARACTERISTICS: 

1173 (A) LENGTH: 301 base pairs 



PAGE: 24 RAW SEQUENCE LISTING DATE: 10/27/93 
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- -> 



- - > 



1174 
1175 
1176 
1177 
1178 
1179 
1180 
1181 
1182 
1183 
1184 
1185 
1186 
1187 
1188 
1189 
1190 
1191 
1192 
1193 
1194 
1195 
1196 
1197 
1198 
1199 
1200 
1201 
1202 
1203 
1204 
1205 
1206 
1207 
1208 
1209 
1210 
1211 
1212 
1213 
1214 
1215 
1216 
1217 
1218 
1219 
1220 
1221 
1222 
1223 
1224 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 7. .2 97 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AAGCTT ATG AAA TCT AAC AAT GCG CTC ATC GTC ATC CTC GGC 
ACC GTC 48 

Met Lys Ser Asn Asn Ala Leu lie Val lie Leu Gly Thr Val 
15 10 

ACC CTG GAT GCT GTA GGC ATA GGC TTG GTT ATG CCG GTA CTG 
CCG GGC 96 

Thr Leu Asp Ala Val Gly lie Gly Leu Val Met Pro Val Leu Pro Gly 
15 20 25 30 

CTC TTG CGG GAT ATC GTC CAT TCC GAC AGC ATC GCC AGT CAC 
TAT GGC 144 

Leu Leu Arg Asp lie Val His Ser Asp Ser lie Ala Ser His Tyr Gly 
35 40 45 

GTG CTG CTA GCG CTA TAT GCG TTG ATG CAA TTT CTA TGC GCA 
CCC GTT 192 

Val Leu Leu Ala Leu Tyr Ala Leu Met Gin Phe Leu Cys Ala Pro Val 
50 55 60 

CTC GGA GCA CTG TCC GAC CGC TTT GGC CGC CGC CCA GTC CTG 
CTC GCT 240 

Leu Gly Ala Leu Ser Asp Arg Phe Gly Arg Arg Pro Val Leu Leu Ala 
65 70 75 

TCG CTA CTT GGA GCC ACT ATC GAC TAC GCG ATC ATG GCG ACC 
ACA CCC 288 

Ser Leu Leu Gly Ala Thr lie Asp Tyr Ala lie Met Ala Thr Thr Pro 
80 85 90 

GTC CTG TGG ATCC 
Val Leu Trp 
95 



(2) INFORMATION FOR SEQ ID NO: 40: 



301 
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1225 (i) SEQUENCE CHARACTERISTICS: 

1226 (A) LENGTH: 97 amino acids 

1227 (B) TYPE: amino acid 

1228 (D) TOPOLOGY: linear 
1229 

1230 (ii) MOLECULE TYPE: protein 

1231 

1232 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

1233 

1234 Met Lys Ser Asn Asn Ala Leu lie Val lie Leu Gly Thr Val Thr Leu 

1235 1 5 10 15 
1236 

1237 Asp Ala Val Gly lie Gly Leu Val Met Pro Val Leu Pro Gly Leu Leu 

1238 20 25 30 
1239 

1240 Arg Asp lie Val His Ser Asp Ser lie Ala Ser His Tyr Gly Val Leu 

1241 35 40 45 
1242 

1243 Leu Ala Leu Tyr Ala Leu Met Gin Phe Leu Cys Ala Pro Val Leu Gly 

1244 50 55 60 
1245 

1246 Ala Leu Ser Asp Arg Phe Gly Arg Arg Pro Val Leu Leu Ala Ser Leu 

1247 65 70 75 80 
1248 

1249 Leu Gly Ala Thr He Asp Tyr Ala He Met Ala Thr Thr Pro Val Leu 

1250 85 90 95 
1251 

1252 Trp 

1253 

1254 




PAGE: 1 SEQUENCE VERIFICATION REPORT DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:20:44 

INPUT SET: S6822.raw 

Line Error Original Text Q^- ^ 

30 Wrong application Serial Number (A) APPLICATION NUMBER: V$J 

189 Entered (231) and Calc. Seq. Length (30) differ (A) LENGTH: 231 base pairs 

205 ft of Sequences for line conflicts w/ running total ATG GAG 48 

210 ft of Sequences for line conflicts w/ running total CAG ATT GCC 96 

215 ft of Sequences for line conflicts w/ running total GGT TAT GGT 144 

220 # of Sequences for line conflicts w/ running total TGT TGC 192 

230 Sequence * thru 33 missing (2) INFORMATION FOR SEQ ID NO:34: 

256 Wrong Sequence Number (2) INFORMATION FOR SEQ ID NO: 10: 

259 Entered (593) and Calc. Seq. Length (158) differ (A) LENGTH: 593 base pairs 

275 ft of Sequences for line conflicts w/ running total ATGAAAAATC TCATTGCTGA 60 

278 ft of Sequences for line conflicts w/ running total GAACTGTGTG CGCAGGTAGA 120 

281 # of Sequences for line conflicts w/ running total GCGCAAAATG ACCAACAGCG 180 

284 ft of Sequences for line conflicts w/ running total CCCGATGCCA GCATTCCTGA 240 

287 ft of Sequences for line conflicts w/ running total TTGAAGCATC CTCGTCAGTA 300 

290 ft of Sequences for line conflicts w/ running total GCCGAGACTT ATAGTCGCTT 360 

293 # of Sequences for line conflicts w/ running total AAAAGGGTAT CTAGAGGTTG 420 

296 ft of Sequences for line conflicts w/ running total ATG TTC 469 

301 # of Sequences for line conflicts w/ running total GAGGAA 517 

306 ft of Sequences for line conflicts w/ running total GCG ATA 565 

316 Sequence 1 1 thru 36 missing (2) INFORMATION FOR SEQ ID NO:37: 

339 Wrong Sequence Number (2) INFORMATION FOR SEQ ID NO: 1 1 : 

342 Entered (1500) and Calc. Seq. Length (359) differ (A) LENGTH: 1500 base pairs 

358 ft of Sequences for line conflicts w/ running total ATGAAAAATC TCATTGCTGA 60 

361 ft of Sequences for line conflicts w/ running total GAACTGTGTG CGCAGGTAGA 120 

364 # of Sequences for line conflicts w/ running total GCGCAAAATG ACCAACAGCG 180 

367 ft of Sequences for line conflicts w/ running total CCCGATGCCA GCATTCCTGA 240 

370 ft of Sequences for line conflicts w/ running total TTGAAGCATC CTCGTCAGTA 300 

373 ft of Sequences for line conflicts w/ running total GCCGAGACTT ATAGTCGCTT 360 

376 ft of Sequences for line conflicts w/ running total TCACGTAAAA AGGGTATCTA 420 

379 ft of Sequences for line conflicts w/ running total CTT GCA TCT 470 

384 ft of Sequences for line conflicts w/ running total ACT AC A 518 

389 ft of Sequences for line conflicts w/ running total TTC AAG 566 

394 ft of Sequences for line conflicts w/ running total ACT GTT 614 

399 ft of Sequences for line conflicts w/ running total TTT TAC ACA 662 

404 ft of Sequences for line conflicts w/ running total GAT GTG AAG 7 10 

409 ft of Sequences for line conflicts w/ running total GTG GAG 758 

414 ft of Sequences for line conflicts w/ running total GAG TTC 806 

419 ft of Sequences for line conflicts w/ running total AGT TTT 854 

424 ft of Sequences for line conflicts w/ running total GAA CGG ACT 902 

429 ft of Sequences for line conflicts w/ running total TTT GGC 950 

434 ft of Sequences for line conflicts w/ running total TCA GGA 998 

439 ft of Sequences for line conflicts w/ running total GTG GAT 1046 

444 ft of Sequences for line conflicts w/ running total TCC CGA 1094 

449 ft of Sequences for line conflicts w/ running total ATG GGC CAG 1 142 

454 ft of Sequences for line conflicts w/ running total GTG GTA 1 190 

459 ft of Sequences for line conflicts w/ running total CAC TAAAATTCTC 1245 

464 ft of Sequences for line conflicts w/ running total TTTATCACAG TTAAATTGCT 1305 

467 ft of Sequences for line conflicts w/ running total CATCGTCATC CTCGGCACCG 1365 
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Line 

470 

473 

478 

541 

595 

619 

643 

671 

685 

688 

709 

712 

735 

744 

761 

764 

783 

787 

790 

809 

814 

833 

838 

857 

862 

881 

888 

907 

914 

933 

940 

959 

964 

983 

1037 

1040 

1056 

1061 

1066 

1071 

1076 

1081 

1086 

1091 

1096 

1101 

1106 

1109 



SEQUENCE VERIFICATION REPORT DATE: 10/27/93 

PATENT APPLICATION US/08/080,354 TIME: 15:20:58 
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Error 

tt of Sequences for line conflicts w/ running total 

ft of Sequences for line conflicts w/ running total 

Sequence 12 thru 37 missing 

Wrong Sequence Number 

Entered (42) and Calc. Seq. Length (0) differ 

Entered (41) and Calc. Seq. Length (0) differ 

Entered (42) and Calc. Seq. Length (0) differ 

Sequence 18 missing 

Wrong Sequence Number 

Entered (41) and Calc. Seq. Length (0) differ 

Sequence 19 missing 

Entered (64) and Calc. Seq. Length (24) differ 

ft of Sequences for line conflicts w/ running total 

Sequence 21 missing 

Wrong Sequence Number 

Entered (62) and Calc. Seq. Length (22) differ 

ft of Sequences for line conflicts w/ running total 

Sequence 22 missing 

Entered (50) and Calc. Seq. Length (10) differ 
ft of Sequences for line conflicts w/ running total 
Entered (50) and Calc. Seq. Length (10) differ 
ft of Sequences for line conflicts w/ running total 
Entered (55) and Calc. Seq. Length (15) differ 
ft of Sequences for line conflicts w/ running total 
Entered (64) and Calc. Seq. Length (24) differ 
ft of Sequences for line conflicts w/ running total 
Entered (84) and Calc. Seq. Length (44) differ 
ft of Sequences for line conflicts w/ running total 
Entered (84) and Calc. Seq. Length (44) differ 
ft of Sequences for line conflicts w/ running total 
Entered (52) and Calc. Seq. Length (12) differ 
ft of Sequences for line conflicts w/ running total 
Entered (60) and Calc. Seq. Length (20) differ 
ft of Sequences for line conflicts w/ running total 
Sequence 34 missing 

Entered (915) and Calc. Seq. Length (223) differ 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 
ft of Sequences for line conflicts w/ running total 



Original Text 

ACTGCCGGGC CTCTTGCGGG 1425 
GCTCCTAGCG CTATATGCGT 1485 
(2) INFORMATION FOR SEQ ID NO:38: 
(2) INFORMATION FOR SEQ ID NO: 12: 
(A) LENGTH: 42 base pairs 
(A) LENGTH: 41 base pairs 
(A) LENGTH: 42 base pairs 
(2) INFORMATION FOR SEQ ID NO: 19: 
(2) INFORMATION FOR SEQ ID NO: 18: 
(A) LENGTH: 41 base pairs 
(2) INFORMATION FOR SEQ ID NO:20: 
(A) LENGTH: 64 base pairs 
AGG AAA CCC 49 
(2) INFORMATION FOR SEQ ID NO:22: 
(2) INFORMATION FOR SEQ ID NO:21: 
(A) LENGTH: 62 base pairs 
TCATGCCGCA AATGGCAATC 60 
(2) INFORMATION FOR SEQ ID NO:23: 
(A) LENGTH: 50 base pairs 
TTTGTTTGCG 50 
(A) LENGTH: 50 base pairs 
CCGCAAGAGT 50 
(A) LENGTH: 55 base pairs 
TGCAGTTTGT TTGCG 55 
(A) LENGTH: 64 base pairs 
CCGCACAGAG TTTCGGGACC 60 
(A) LENGTH: 84 base pairs 
GGTTGCCGTC GCAGCGGGCG 60 
(A) LENGTH: 84 base pairs 
CGGCAACCGC CAGAGGAAGT 60 
(A) LENGTH: 52 base pairs 
GTTGTACCAA AA 52 
(A) LENGTH: 60 base pairs 
ATGCACTGTA GAGTTGTGCA 60 
(2) INFORMATION FOR SEQ ID NO:35: 
(A) LENGTH: 915 base pairs 
ATG AGC 47 
CAG ACA CCT 95 
ACA GAA 143 
GAG CTG 191 
CAA CAA 239 
TTT AAG 287 
AGT CCT TCA 335 
AGA CAA 383 
ACC AAA 431 
ACATCTCGTA 482 
TCTATCAGGT CAATTCTCAT 542 
TATCACAGTT AAATTGCTAA 602 
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Error 


Original Text 




1112 


ft of Sequences for line conflicts w/ running total 


TCGTCATCCT CGGCACCGTC 


662 


1115 


ft of Sequences for line conflicts w/ running total 


TGCCGGGCCT CTTGCGGGAT 


722 


1118 


ft of Sequences for line conflicts w/ running total 


TGCTAGCGCT ATATGCGTTG 


782 


1121 


ft of Sequences for line conflicts w/ running total 


ACCGCTTTGG CCGCCGCCCA 


842 


1124 


ft of Sequences for line conflicts w/ running total 


CGATCATGGC GACCACACCC 


902 


1170 


Sequence 37 thru 38 missing 


(2) INFORMATION FOR SEQ ID NO:39: 


1173 


Entered (301) and Calc. Seq. Length (49) differ 


(A) LENGTH: 301 base pairs 




1189 


ft of Sequences for line conflicts w/ running total 


ACC GTC 48 




1194 


ft of Sequences for line conflicts w/ running total 


CCG GGC 96 




1199 


ft of Sequences for line conflicts w/ running total 


TAT GGC 144 




1204 


ft of Sequences for line conflicts w/ running total 


CCC GTT 192 




1209 


ft of Sequences for line conflicts w/ running total 


CTC GCT 240 




1214 


ft of Sequences for line conflicts w/ running total 


ACA CCC 288 
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